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1. INTRODUCTION 

Electricity is an extremely important source of energy and plays a significant role in a country’s 
economic development [1]. Load forecasting is necessary for the proper functioning of electrical dispatch 
centers. Load forecasting is a method used to maintain synchronicity of demand and supply of electrical 
power. With a greater contention for the market and greater decentralization, short-term forecasting is 
becoming more significant [2]. In an age where smart grids with advanced sensing and communication are 
fast becoming a reality, load forecasting is a field where the scope and necessity of accuracy are increasing 
day by day [3]. Numerous significant decisions depend upon the load forecasts like economic dispatch, 
distribution schedule, schedule of protection, and maintenance measures [4]. From proper maintenance of 
equipment to the economic strategies of the suppliers, load forecasting has a significant impact [5]. 
Especially for small-scale consumption units, peak load forecasting is very important [6]. Moreover, there 
has been an increased tendency of winters being colder and summers being more extreme than before. 
Therefore, greater use of equipment like air conditioners and heaters, and their use has become even more 
frequent [7]. This has led to more swings in terms of peak load and minimum load. 

Many factors impact electrical load, their interrelation is complex and so is the extent to which one 
factor overrides one another. The factors can be divided into three categories [8]. Climate is considered the 
most important factor [9]. The short-term factors: They are factors that last only a little duration, like a 
sudden weather change. The middle-term factors: They last for a substantial duration and have a distinct 
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characteristic that governs the corresponding load variation. For example, seasonal climatic variations. The 
long-term factors: They last for a significant time period, and usually over multiple forecasting periods [10]. 
For a particular area, the temperature is the measure of the average warmth or coolness of the surrounding. 
Temperature is far more influential than other factors like wind speed and cloud cover [11]. 

When the temperature falls to a certain extent, it becomes cold and households require more energy [12]. 
Similarly, after a temperature rise, more energy is required. Both in summers and winters, there is a strong 
correlating contribution between temperature and load curve. There is positive co-relation for summers, i.e. 
with temperature rise in summer leading to increased consumption of electrical load as appliances such as 
fans, coolers, and air conditioners (ACs), are turned on and if in summer the temperature falls, the same 
appliances are turned off for lesser load consumption. But there is a negative co-relation for winters, as only 
when the temperature falls, appliances used to keep the households warm are used. Generally, it can be seen, 
on working days there are substantial differences in load demands compared to working days and Weekends. 
There’s lower consumption on Tuesdays to Thursdays while on weekends and days closer to weekends such 
as Mondays and Fridays the consumption is higher [13]. Another trend that can be observed is that on 
moving holidays: Holidays which do not have any fixed date, e.g. the religious festivals, also impact the 
forecast. Generally, on days of festivals, the demand is relatively higher. But since industrial activities are 
lesser, the overall consumption prediction becomes difficult. 

In a broad sense, there are two types of models proposed or used for predicting future electrical 
demands, conventional statistical techniques, and artificial intelligence (AI) based techniques. Classic models 
use historical data and process them, and the estimates of parameters in such models can be easily 
interpreted. The models and techniques that fall under this category include auto regressive moving average 
(ARIMA) model [14], the regression seasonal ARIMA generalized autoregressive conditional heteroskedastic 
(Reg-SARIMA-GARCH) model [15], support vector machine models [16]. Time series model for series 
exhibiting multiple complex seasonalities (TBATS) [17]. AI techniques on the other hand prove to be more 
flexible due to their ability to adapt to moving data. The AI functions are nonlinear and nonparametric. In 
general, the AI models yield better results than traditional ones [18]. Neural networks and deep learning 
models have proven to be more accurate for electrical load forecasts than the traditional model. With the 
advent of smart grids and the ever-diversifying application of data analytics, a huge amount of data inflow 
and ever-increasing applications based on their analysis are expected [19], [20]. AI and machine learning 
techniques are expected to find a variety of uses not only in load forecasting but also in theft detection [21], 
protection and safety of nuclear [22] and thermal power plants [23] along with power price determination [24]. 

This paper has attempted to explore the implementation of relatively newer AI techniques in the 
domain of electrical load forecasting. Forecasting for electrical loads is a complex process that is prone to 
slight errors even when utmost care is taken in choosing the methods. This occurs due to the multiple factors 
influencing load patterns. Even such slight errors can lead to grave consequences to power system equipment 
and also gravely impact economic interests. These patterns are sometimes completely independent of each 
other and thus it becomes inherently impossible to find a co-relation. This paper inspects the use of the long 
short term memory (LSTM) model to solve this complex problem. For the same, we have to ensure that the 
data set on which this study is to be done, is organically dynamic and encapsulates the impact of all the 
factors. To achieve this, we use data taken from a major Dispatch Center, State Load Despatch Center 
(SLDC) State Load Dispatch Center, located in Delhi, one of the biggest cities of one of the biggest cities in 
the world in terms of active consumers. This paper, therefore, inspects the applicability of the LSTM model 
in load forecasting over a dynamic consumer base. This can create a platform for further exploration of the 
problem through LSTM using optimizers and supporting mechanisms. LSTM proves to be appreciably viable 
in handling the complex problem that electrical load forecasting presents. 


2. RESEARCH METHOD 

Our focus in this paper was to use the LSTM model to correctly predict the electrical load. LSTM is 
often used for time series forecasting, we chose to test how accurate it is for electrical load forecasting. To 
implement the algorithm on organic and potent data set, we scrapped the site state load dispatch centre, 
Delhi. We scrapped through the data from the 28" of January to the 28" of February. Parameters for each 
epoch of model development and training are presented in Table 1. From Table 1, 20 epochs were taken, with 
a batch consisting of 4600 data points. For cross-validating, 10 epochs were taken. The epochs and batch size 
were decided based on calculations and then approximated by trial and error for the best possible result. After 
the forecasts, root mean square error is used to compare the actual load to the forecast done by the model and 
received satisfactory results. 
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Table 1. Parameters for each epoch of model development and training 


EPOCHS Time taken and Loss Value EPOCHS Time taken and Less Value 
Time per step Loss Time per step Loss 

1 8s 1ms/step 0.0320 0.0190 11 6s 1ms/step 0.0088 0.0074 
2 6s 1ms/step 0.0235 0.0137 12 6s 1ms/step 0.0087 0.0073 
3 6s 1ms/step 0.0160 0.0094 13 6s Ims/step 0.0086 0.0072 
4 6s 1ms/step 0.0112 0.0091 14 6s Ims/step 0.0085 0.0071 
5 6s Ims/step 0.0103 0.0088 15 7s Ims/step 0.0084 0.0070 
6 6s 1ms/step 0.0099 0.0085 16 9s 1ms/step 0.0084 0.0069 
7 6s 1ms/step 0.0096 =: 0.0082 ies 7s 1ms/step 0.0083 0.0069 
8 6s 1ms/step 0.0093 ~—-0.0080 18 7s 1ms/step 0.0083 0.0068 
9 6s 1ms/step 0.0091 0.0078 19 6s Ims/step 0.0082 0.0067 
10 6s_1ms/step 0.0090 _0.0076 20 7s_lms/step 0.0082 0.0067 


2.1. The LSTM model 

Traditional neural networks cannot use the concept of memory. They can’t use the knowledge of 
previous states. This is a major drawback. Recurrent neural networks (RNN) in terms of architecture is not 
that different from the conventional neural network. An RNN however is capable of learning from memory. 
Figure 1 shows traditional neural networks, it is clear that since the output of neural networks doesn’t loop 
back to previous layers, the previous states have no contribution towards future ones. Figure 2 shows a 
simple RNN, with a loop. Recurrent neural networks have proved to be powerful and accurate in their 
application. LSTM is a slightly different kind of RNN that overcomes some shortcomings of the standard 
version. 

LSTM does not suffer from the short-term dependency problem of usual RNNs. Recurrent neural 
networks tend to prove inefficient when data shows more long term dependency than short term. LSTM does 
not have this issue and is considered suitable for time series modelling. LSTMs like RNN have a chain-like 
structure. However, in LSTM the repeating module has a slightly more complex structure. Each module has 4 
layers and each layer interacts with one another. From Figure 3, it can see a cell state, represented by the top 
line running through the entire chain. The cell state only involves a few linear interactions. LSTM repeating 
module can either attach or delete the information running in the cell state. This is achieved through a “Gate”. 
Gate is maintained or changes the cell state. 


Figure 1. Traditional neural network Figure 2. RNN diagram 


Figure 3. Basic structure of LSTM 
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Figure 4 shows how one of the reasons LSTM is different from RNN is the absence of a cell state. 
The cell state is the key to LSTM’s ability to recognize short-term patterns. Figure 5 highlights the initial 
step, which is to determine if the incoming information from the cell state is to be deleted or not. 


Figure 4. RNN does not have the cell states 
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Figure 5. Highlighting a cell state 


The initial step is to determine if the information from an incoming cell state has to be deleted. This 
is determined by the “forget gate layer”. Thereafter, we decide if we have to add any information. This is 
divided into two steps. An “input Gate layer” will determine the values which are to be updated, afterward a 
tan h section that creates a set of value that is to be added in cell state. Hereafter, they are combined to update 
the state. To achieve that, we multiply the old state with Function F(t). And we add to it, It* C~t. Finally, we 
employ a sigmoid layer which determines what will be the output. We now use tanh which restricts values to 
a smaller range. Now we ensure certain selected sections are treated as outgoing output using another gate. 


3. RESULTS AND DISCUSSION 
3.1. Dataset source 

For the data, we have scrapped the site of SLDC, Delhi. The SLDC is responsible for an integrated 
power supply to Delhi. The site updates load data every five minutes. To scrap the data, we have used 
Beautiful soup, a python library used for basic scrapping. It is capable of extracting data from hypertext 
markup language and extensible markup language documents. We took load data for the last month. The load 
data is taken every 5 minutes. Figure 6 shows the load data obtained for the 23" of February 2021. It can be 
seen, the load demanded is lesser in the night and peaks during a time span of 9 to 12 o’clock span. In Figure 7 
we see the variation of load from 28" of January 2021 till 28 of February 2021. It shows the entire 30-day 
period load. 
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Figure 6. Load data on a particular date 
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Figure 7. Variation of load in a particular month 


3.2. Data cleaning and preparation 


Now using seasonal decompose from Python’s Stats model library, we decompose the data (using 
daily frequency as a basis) into trends, seasonality, and residue. Figure 8 shows the results of using seasonal 
decompose, the topmost is the actual observed data, the next section shows the prevalent trend and then the 
regularity of structure is shown by the seasonal part. The next section is the residual part. 
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Figure 8. Observed, trend, seasonal, and residual data 
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3.3. Identifying the trend of the time series data set and detrending 

Figure 8 shows the trend of data, which can be considered to be a time series data set. A trend is 
defined as a regular increase and decrease of values over the mean. The trend present in the data set is of 
stochastic type. Panda et al. [25] argued that detrending can reduce errors in forecasts and improve overall 
performance. Thus removal of this trend can improve the forecasting ability of the model. The removal of a 
trend is called detrending. Detrending must be done with proper methods else becomes detrimental. 
Detrending doesn’t always improve performance, specifically for machine learning applications. However, 
we have chosen to detrend our data because it has proven to be beneficial for time series forecasting [26]. 


3.4. Removing seasonality and rescaling the data 

Seasonality refers to regularly repeating patterns in the data set. Seasonal components tend to obscure 
the actual data pattern that is significant for modeling [27], [28]. In this work, a different method to remove 
seasonality for our data set. Now before the data set can be used for training and fitting into the network, it 
should be scaled down to much lower values so that those processing are faster and more efficient [29], [30]. 
We have scaled down our data to lie between -1 to 1. Figure 9 shows the data after detrending, removal of 
seasonality, and rescaling. 
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Figure 9. Detrended and rescaled data 


3.5. Model training and forecast 

First data is reshaped and treated to start the model training. From Keras layers, we directly invoke 
LSTM and from Keras. Models we invoke sequentially, now we have to decide the epochs and batch 
size [31], [32]. We have decided to run the model training for 30 epochs the batch size has been taken as one. 
After training for 30 epochs and cross-validating as well, the model is ready for making forecasts. Figure 10 
shows the load forecast. Root mean square error (RMSE) or root mean squared error is one of the standard 
error parameters when only two dimensions are involved. In this work, RMSE is selected as it doesn’t get 
affected by the curse of dimensionality. 


Y axis: Load (in W) 
X axis: time 
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Figure 10. Load forecast 
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3.6. Comparison of forecast with actual load 

Figure 11 shows a comparison between forecasts and actual load. The forecast is depicted in orange 
color, while the blue graph represents actual load [33]. It shows appreciable accuracy except for a distinct 
region located left to the middle mark. 

The RMSE evaluation reveals an error of 127 W, which is well within the range of appreciable 
accuracy, is about 4.1% to 3.2% of the observed range of peak load experienced in a day. Thus long short 
term neural network models show significant accuracy in terms of load forecasting. However, further 
accuracy is required to ensure real life application in actual power plants where a difference of 4.1 to 3.2 % 
might imply a difference of the magnitudes of 1000s of KWs. 
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Figure 11. Comparison between actual load and load forecast 


4. CONCLUSION 

LSTM shows appreciable accuracy for electrical load forecasts. It outperforms traditional statistical 
prediction models and also outperforms many earlier used standard RNN methods. However, room for error 
persists. Therefore, LSTM can be enhanced with the addition of other techniques. The LSTM model 
supported by pinball loss results in better performance than a standard one. Another method is to use various 
optimizers to improve the performance of the LSTM network and then using it for the forecasts. LSTMs can 
prove to be quite efficient for load forecasts, especially if further improvements are applied to the model. The 
proposed methodology is subjected to a dynamic data set. From the results it can be concluded that LSTM as 
a single model is relatively more suitable for electrical load forecasting than traditional methods. Moreover, 
its superiority might improve beyond other AI techniques with the use of correct optimizers. 
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